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REMARKS/ARGUMENTS 

In paragraph 2 of the Office action, the examiner states that the Information Disclosure 
Statement filed 10/20/2003 did not contain the required legible copy of each non-patent literature 
publication. In response, a copy of form PTO/SB/08B and copies of the two publications listed 
on that form are filed herewith. 

In paragraph 3 of the Office action, the examiner states that serial numbers and dates are 
missing from paragraph [0044] of the specification. Appropriate amendments have been made to 
paragraph [0044], 

Double Patenting 

In paragraph 5 of the Office action, claims 1-20 stand provisionally rejected on the 
grounds of nonstatutory, obviousness-type, double patenting as being unpatentable over claims 
1-16 and 18-21 of copending Application No. 10/689,355. Because this is a provisional 
rejection, applicant will address this rejection when allowable subject matter is indicated. 

In paragraph 8 of the Office action, claims 1-8, 12-15, and 20 stand provisionally rejected 
on the grounds of nonstatutory, obviousness-type, double patenting as being unpatentable over 
claims 1-9, 11-17, and 20 of copending Application No. 10/689,312 (the '312 application). 
Applicant respectfully traverses that rejection for the following reasons. 

Applicant respectfully disagrees with the examiner's characterization of the relationship 
between the pending claims and the claims of the 6 3 12 application. Specifically, the examiner 
asserts that "the only difference is the substitution of 'determining a sum deviation' in the 
present application with the step of 'determining a running partial deviation sum' in copending 
Application No. 10/689,312." In claim 1 of the '312 application, the local deviation is calculated 
from the local mean number and a running partial deviation sum is determine for each of the 
plurality of processing elements. In contrast, in the present application, claim 1 recites 
determining a sum deviation from said local deviations for one-half of said loop in an anti- 
clockwise direction and determining a sum deviation from said local deviations for one-half of 
said loop in a clockwise direction. A clockwise transfer parameter and an anti-clockwise transfer 
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parameter are determined from the sum deviations. Applicant asserts that the determination of 
sum deviations for one-half of the loop in a clockwise direction and sum deviations for one-half 
of the loop in an anti-clockwise direction in place of determining a running partial deviation sum 
would not have been obvious to a person of ordinary skill in the art. The alleged motivation of 
"the desire to have a greater variety of choices when performing the claimed load balancing 
method" does not provide a legal analysis justifying the conclusion that the claims of the instant 
application are obvious in view of the claims of the 6 3 12 application. For the foregoing reason, 
applicant respectfully requests that the double patenting rejection of claim 1-8, 12-15, and 20 on 
the basis of copending Application No. 10/689,312 be withdrawn. 

35 U.S.C. § 101 

In paragraphs 1 1 and 12 of the Office action, claim 20 stands rejected under 35 U.S.C. 
§ 101 for reciting "a memory device." In response, claim 20 has been amended to recite "a 
computer readable memory device." Claims to a "computer readable medium" are authorized in 
the Interim Guidelines for Subject Matter Eligibility, in the section dealing with "practical 
application." It is believed that claim 20, as amended, is in compliance with the interim 
guidelines such that the 35 U.S.C. § 101 rejection should be withdrawn. 

35 U.S.C S 112 

In paragraph 14 of the Office action, claims 1-20 stand rejected under 35 U.S.C. § 1 12, 
second paragraph, as being indefinite for failing to particularly point out and distinctly claim the 
subject matter which applicant regards as the invention. 

In paragraph ai, the examiner states that "calculating a local number of tasks within each 
of said plurality of processing elements" is unclear. Applicant respectfully disagrees. The 
claims are read in light of the specification and the specification discloses at least one method of 
calculating a local mean. See the published application beginning with paragraph [0043]. 
Applicant asserts that one of ordinary skill in the art would know how to calculate a local mean 
based on the disclosure in the specification. Applicant should not be required to write a 
preferred embodiment into the claims. Further, a very similar limitation appears in claim 1 of 
copending application 10/689,312 which has been examined by this same examiner without 
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objection to that language. Applicant respectfully requests that the examiner reconsider this 
rejection. 

In paragraph ai, the examiner next states that in line 1 1 it is unclear whether the local 
deviation determining step is performed based on the preceding step. Claims 1 and 20 have been 
amended to make it clear that the local deviation is calculated from the local mean number. See 
paragraph [0049] of the published application. Claim 1 1 already has such language. 

In paragraph ai, the examiner next states that in lines 12-17 it is unclear how a sum 
deviation is defined. Claims 1,12, and 20 have been amended to recite that the sum deviations 
are determined from the local deviations. See the published application beginning with 
paragraph [0050]. 

In paragraph ai, the examiner next states that in lines 18-19 it is unclear whether the 
transfer parameters are based on the results of the preceding steps. Appropriate language tying 
the steps together has been added to claims 1,12, and 20. See the published application 
beginning at paragraph [0053]. 

In paragraph aii, it is the examiner's position that it is unclear what is meant by the "V" in 

claims 4 and 14. Each of claims 4 and 14 has been amended to recite that "P' is the total number 

of tasks. The examiner also indicates, with respect to "E", that it is unclear how that value is 

determined for each of the plurality of processing elements. The examiner's attention is 

respectfully directed to paragraph [0046] of the published application which provides: 

The rounding function M r = Trunc((V + E r )l N) prevents tasks 
from being lost or gained (where M r represents the local mean for 
PE r , N represents the total number of PEs 30 in the loop 50, and E r 
represents a number in the range of 0 to (N-l)). In the current 
embodiment, each PE is assigned a different E r value for 
controlling the rounding. The simplest form for the function E is 
the case in which E r = P r , where P r represents the PEs position 
round the loop. For example, for PE 0 , E 0 = 0; for PEi, E| = 1; for 
PE 2 , E 2 = 2; etc. By assigning each PE in the loop a different E r 
value, the rounding function can be controlled such that some of 
the local means are rounded up and some of the local means are 

i=N-\ 

rounded down, thus insuring that V = M x . It should be noted 

/=0 

that in the current embodiment, the local mean for each PE 30 in 
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the loop is computed in parallel with the local means of the other 
PEs in the loop. 

It is submitted that reading claims 4 and 14 in view of the disclosure of paragraph [0046], 
one of ordinary skill in the art would understand how the value E r is derived for each of the 
plurality of processing elements. Finally, with respect to claim 14, a definition has been 
provided for PE r . 

In paragraph aiii, the examiner indicates that it is unclear in claims 5 and 15 how E r 
"controls" the Trunc function. The language of claim 5 and claim 15 has been amended to recite 
that the Trunc function is responsive to the value of E r . With respect to the examiner's question 
about how this step is possible, "since each E r value is set ahead of time and must be different for 
each processing element," the examiner's attention is respectfully directed to paragraph [0046] 
reproduced above. 

With respect to paragraph aiv, the examiner states that the recitation of "X and (X+l)" is 

unclear. The examiner's attention is respectfully directed to paragraph [0014] of the published 

application which provides as follows: 

The present invention enables tasks to be distributed along a group 
of serially connected PEs so that each PE typically has X number 
of tasks or (X+l) number of tasks to perform in the next phase. 
The present invention may be performed using the hardware and 
software (i.e., the local processing capability) of each PE within 
the array. Those advantages and benefits, and others, will become 
apparent from description of the invention below. 

The examiner's attention is also directed to the table appearing in paragraph [0048] of the 
published application which provides: 
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Table #1 - Local Mean Calculation for the Loop 50 (V= 43, N = 8). 



The language of claim 6 has been amended to indicate that a local mean for each group is 
equal to either X, or X+l , as seen clearly from Table No. 1 where X = 5 and X+l = 6. A 
definition is provided for E r . 

In paragraph av, the examiner states that it is unclear in claim 10 whether "S," which 
represents the deviation of a selected processing element, is the same as the local deviation in 
claim 1. Claim 10 has been amended to eliminate any ambiguity. 

In; paragraph avi, the examiner states that it is unclear in claim 1 1 what is meant by the 
variables A, Mag, T a and T c . Claim 1 1 has been amended to make it clear that T a and T c 
represent the anti-clockwise and clockwise transfer parameters and to clarify the relationship 
between the parameters. See paragraphs [0058] and [0059] of the published application. 

In paragraph avii, the examiner states that it is unclear in claim 12 how the insertion of 
the phantom element relates to the rest of the recited steps. Claim 12 has been amended to 
indicate that the phantom element is inserted when there is an odd number of processing 
elements in the loop. The phantom element is assigned a local deviation of zero. Support for 
this amendment can be found in paragraph [0057] of the published application and original 
claim 16. 

In view of the foregoing, it is respectfully requested that the rejection of claims 1-20 
under 35U.S.C.§112, second paragraph, be withdrawn. 

35 U.S.C. $103 

In paragraph 16 of the Office action, claims 1-20 stand rejected under 35 U.S.C. § 103(a) 
as being unpatentable over Smith (U.S. Pub. No. 2004/0024874) in view of Wheat (U.S. Patent 
No. 5,630,129). Applicant respectfully traverses that rejection. 

It is the examiner's position that both Smith and Wheat teach methods of load balancing. 
Although that assertion is correct, the methods of load balancing taught by Smith and Wheat are 
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so different from one another, and different from what is claimed, that it is not possible for the 
combination of those two references to suggest the claimed invention. 

The examiner asserts that Smith teaches "calculating a local deviation for each of said 
plurality of processing elements." Claims 1 and 20 have been amended to recite that the local 
deviation is calculated "from said local mean number." Claim 12, as originally presented, 
contained similar subject matter. Smith does not teach calculating a local deviation from said 
local mean number because, as the examiner recognizes, Smith does not teach calculating a local 
mean number. Accordingly, the "calculating a local deviation from said local mean number" is 
not taught by Smith. 

The examiner next asserts that Smith discloses "determining a clockwise transfer 

parameter and an anti-clockwise transfer parameter for each of said plurality of processing 

elements" citing paragraphs 1 8-20 of Smith. Smith does not calculate a transfer parameter as 

asserted by the examiner. Smith merely compares the workloads of pairs of processors, and the 

processor having the lower level of work simply requests work from the processor having a 

higher level of work. There is no transfer parameter calculated. This is made clear in Smith, 

paragraph [0038], which recites in part: 

With a uni-directional link from processor A 13 ("upstream") to 
processor B 14 ("downstream"), A informs B of how much 
workload it has, B then compares this with its own level of 
workload, and if B is less loaded than A, then it requests work 
from A. It is therefore ensured that B has at least as much work as 
A. Such pairs are linked end to end in a chain, with all the links 
going in the same direction, with the ends of the chain joined 
together. This forms a closed loop with all the workload transfers 
travelling in the same direction. Since in each pair the one 
downstream of the link has at least as much work as the one 
upstream, and every processor in every pair downstream of another 
processor, it ensures that the entire ring is inherently balanced. 

Finally, the examiner asserts that Smith teaches "redistributing tasks among said plurality 
of processing elements in response to said clockwise transfer parameter and said anti-clockwise 
parameter for each of said plurality of processing elements" citing paragraph [0020] of Smith. 
As discussed above, Smith does not operate in such a manner so as to generate transfer 
parameters. Accordingly, the redistribution does not take place in response to transfer 
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parameters. Redistribution takes place in Smith in response to a request from one processing 
element, having a lighter workload, made to another processing element having a heavier 
workload. 

The examiner acknowledges that Smith does not teach "determining a total number of 
tasks present within said loop; calculating a local mean number of tasks for each of said plurality 
of processing elements; [or] determining a sum deviation within each for each of said plurality of 
processing elements for one-half of said loop in a clockwise and an anti-clockwise direction." 
From that admission and the previous arguments, it is thus seen that Smith, although it does deal 
with load balancing, operates in such a completely different manner that it discloses none of the 
steps of claims 1,12, and 20. 

The defects of the primary reference to Smith are not overcome by Wheat. The examiner 

asserts in paragraph 20 that "Wheat teaches a dynamic load balancing method by determining the 

average load across a processor array and minimizing a global imbalance or [sic of] workloads 

within a finite number of balancing steps" citing column 6, lines 58-67. Although the cited 

portion of Smith does discuss a "global imbalance," this is part of a discussion in which Wheat 

proves that his method minimizes the global imbalance. The actual method is set forth beginning 

in column 5, line 50, with a determination of workloads. Workloads are then compared amongst 

processors. See column 5, lines 60-67, which provide: 

Each processor compares its work load to the work load of the 
other processors in its neighborhood and determines which 
processors have greater work loads than its own. If any are found, 
it selects the one with the greatest work load (ties are broken 
arbitrarily) and sends a request for work to that processor. Each 
processor may send only one work request, but a single processor 
may receive several work requests. 

Transfers take place according to priorities as discussed in column 6, lines 40-57, which 

provide as follows: 

FIG. 4 illustrates an example of element priorities and selection for 
exporting four elements to the east neighboring processor. Initially, 
elements 3, 6, 9, and 12 are eligible for export. Their priorities are 
computed; element 3, for example, has priority -2, since it has two 
local neighbors (-2), one neighbor in a concerned partner processor 
(-2), and one neighbor in the importing processor (-K2). Elements 6 
and 9 share the highest priority, but since element 6 has a greater 
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work load, it is selected. Element 5 becomes eligible for export, 
but its priority is low since it has three local neighbors. The 
priorities are adjusted, and element 9 is selected, making element 8 
a candidate. The priorities are again updated, and the selection 
process continues with elements 3 and then 12 being selected. 
Although the work request is not completely satisfied, no other 
elements are exported, as the work loads of the elements with the 
highest priority, 5 and 8, are greater than the remaining work 
request 

It is seen that Wheat, although disclosing a method for dynamic load balancing, teaches a 
very different method than either the claimed invention or Smith. Processor work requests are 
determined based on processors comparing their workloads with other processors. Requests are 
then made and granted on the basis of priorities. There is no calculating a local mean number of 
tasks, calculating a local deviation, using the local deviations to determine sum deviations for 
one-half the loop in an anti-clockwise direction and for one-half the loop in a clockwise 
direction, determining clockwise and anti-clockwise transfer parameters from the sum 
deviations, or redistributing tasks based on the transfer parameters. It is respectfully submitted 
that the load balancing techniques of Smith and Wheat are so dissimilar from one another, and so 
dissimilar from the claimed invention, that no possible combination of the teachings of the two 
references renders obvious claims 1-20. For the foregoing reasons, applicant respectfully 
requests that the 35 U.S.C. § 103 rejection of independent claim 1 based on the combination of 
Smith and Wheat be withdrawn. 

With respect to paragraphs 25 and 26 of the Office action and claim 4, the Office relies 
upon official notice for the proposition that "it is well known to perform a local mean calculation 
using this method and using a truncation function to remove unnecessary decimals." While such 
a function may be well known in the art, the methods of Wheat and Smith are complete in 
themselves. The effort to graft an unnecessary step onto the methods of Wheat and Smith 
through the use of official notice is nothing more than an improper hindsight reconstruction. The 
rejection of claim 4 should be withdrawn. 

With respect to paragraphs 27 and 28 and claim 5, the Office provides no basis for the 
conclusion that "it would have been obvious ... to change the value of E r " The examiner 
provides no basis for where that teaching is found, no citation to the art of record, and no reliance 
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upon official notice. The Office appears to be using applicant's disclosure as a basis for a 
hindsight reconstruction of claim 5. 

With respect to paragraphs 29 and 30 of the Office action and claim 6, neither Smith nor 
Wheat teaches calculating a local mean. Thus, it is difficult to understand how the references 
can disclose the details of claim 6 when the broad concept of calculating a local mean is not even 
disclosed. 

With respect to paragraphs 3 1 and 32 of the Office action and claim 7, because neither 
Smith nor Wheat discloses determining a local mean, one of ordinary skill in the art would not be 
interested in calculating a local deviation, by any means. The Office appears to be using 
applicant's disclosure as a basis for a hindsight reconstruction of the claim. 

With respect to paragraphs 33-34 of the Office action and claims 8 and 9, the examiner 
provides no basis for where the missing teachings are found; there is no citation to the art of 
record and no reliance upon official notice. The Office appears to be using applicant's disclosure 
as a basis for a hindsight reconstruction of claims 8 and 9. 

With respect to paragraphs 35-38 of the Office action and claims 10 and 1 1, neither 
Smith nor Wheat discloses determining clockwise and anti-clockwise transfer parameters. Given 
that recognition, it is difficult to understand how the references can disclose the details of claims 
10 and 1 1 when the broad concept of transfer parameters is not even disclosed. 

The same arguments presented above are applicable to claims 12-18 and 19-20. 

Request for Interview 

Applicant has made a diligent effort to place the instant application in condition for 
allowance. If the examiner is of the opinion that the instant amendment does not place the 
currently pending claims in condition for allowance with respect to the art of record, the 
examiner is respectfully requested to contact applicant's attorney at the telephone number listed 
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below so that an interview may be scheduled before the issuance of a final Office action 
rejecting the claims on the basis of the art currently of record. 



Respectfully submitted, 




Edward L. Pencoske 

Reg. No. 29,688 

Jones Day 

One Mellon Center 

500 Grant Street, Suite 3100 

Pittsburgh, PA, USA, 15219 

(412)394-9531 

(412) 394-7959 (Fax) 

Attorneys for Applicant 
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